CLAIMS 



What is claimed: 

1 . (Currently Amended) A method for recognizing speech, the method comprising: 
recognizing a sequence of words; 

processing the sequence of words using word agglomeration that replaces the 
sequence of words with an associated /z-tuple sequence, the w-tuple sequence comprising 
word n-tuples that are all strings of n consecutive words in the sequence of words, 
wherein the w-tuple sequence is represented by a vector representation in a semantic 
space; and 

classifying the processed sequence of words as a predetermined command based 
on the vector representation of the «-tuple sequence in the semantic space. 

2. (Original) The method of claim 1, further comprising performing an action 
corresponding to the predetermined command. 

3-4 (Canceled) 

5. (Previously Presented) The method of claim 1, wherein classifying comprises 
semantically inferring the predetermined command from the associated word //-tuple 
sequence. 

6. (Original) The method of claim 1, wherein classifying comprises semantically 
inferring the predetermined command from the processed sequence of words. 

7. (Original) The method of claim 6, wherein semantically inferring comprises 
determining a correlation between the processed sequence of words and at least one 
semantic anchor. 



09/688,010 



-2- 



4860P2564 



8. (Previously Presented) The method of claim 7, wherein the correlation is a 
distance between the vector corresponding to the processed sequence of words and a 
vector corresponding to the at least one semantic anchor. 

9. (Original) The method of claim 8, wherein semantically inferring further 
comprises selecting the predetermined command from the semantic anchor vector having 
the shortest distance. 

1 0. (Original) The method of claim 9, wherein the semantic anchor represents a one 
of a plurality of predetermined commands. 

1 1 . (Original) The method of claim 7, wherein the at least one semantic anchor is 
derived from a training data. 

12. (Original) The method of claim 6, wherein semantically inferring the 
predetermined command depends on the order of the words in the processed sequence of 
words. 

13. (Previously Presented) The method of claim 1, wherein the classifying comprises: 
generating the vector representation of the processed sequence of words; and 
comparing the vector representation to a plurality of semantic anchors, wherein 

each of the plurality of semantic anchors corresponds to one of a plurality of voice 
commands. 

14. (Previously amended) The method of claim 1 3, wherein the classifying further 
comprises: 

selecting a semantic anchor from the plurality of semantic anchors that is most 
similar to the vector representation; and 

classifying the processed sequence of words as the command that corresponds to 
the selected semantic anchor. 
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15. (Previously Presented) The method of claim 14, wherein the selecting comprises: 
for each of the plurality of semantic anchors, identifying the similarity between 

the vector representation and the semantic anchor by calculating the cosine of the angle 
between the product of the vector representation and a diagonal matrix of singular values 
and the product of the semantic anchor and the diagonal matrix of singular values; and 

selecting the semantic anchor from the plurality of semantic anchors that 
corresponds to the largest cosine value as the semantic anchor that is most similar to the 
vector representation. 

16. (Original) The method of claim 13, wherein the vector representation is an 
indication of how frequently each of a plurality of word w -tuples occurs within the 
processed sequence of words. 

1 7. (Original) The method of claim 1 6, wherein each of the plurality of semantic 
anchors is an indication of how frequently each of the plurality of word « -tuples occurs 
with respect to the corresponding command. 

18. (Original) The method of claim 13, wherein each of the plurality of semantic 
anchors represents a plurality of different ways of speaking the corresponding command. 

19. (Original) The method of claim 13, wherein each of the plurality of semantic 
anchors represents a plurality of different commands having the same words, but in a 
different order. 

20. (Currently Amended) A machine-readable medium having stored thereon a 
plurality of instructions that, when executed by a processor, cause the processor to 
recognize a voice command by: 

recognizing a sequence of words; 

processing the sequence of words using word agglomeration that replaces the 
sequence of words with an associated w-tuple sequence, the w-tuple sequence comprising 
word n-tuples that are all strings of n consecutive words in the sequence of words, 
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wherein the n-tuple sequence is represented by a vector representation in a semantic 
space; and 

classifying the processed sequence of words as a predetermined command based 
on the vector representation of the w-tuple sequence in the semantic space. 

21 . (Original) The machine-readable medium of claim 20, wherein the plurality of 
instructions further cause the processor to perform an action corresponding to the 
predetermined command. 

22-23 (Canceled) 

24. (Previously Presented) The machine-readable medium of claim 20, wherein the 
instructions for classifying comprises semantically inferring the predetermined command 
from the associated word w-tuple sequence. 

25. (Previously Presented) The machine-readable medium of claim 20, wherein the 
instructions for semantically inferring comprises determining a correlation between a 
semantic representation of the associated word w-tuple sequence and at least one semantic 
anchor. 

26. (Original) The machine-readable medium of claim 25, wherein the instructions for 
determining a correlation comprise determining a distance between the vector 
corresponding to the semantic representation and a vector corresponding to the at least 
one semantic anchor. 

27. (Previously Presented) The machine-readable medium of claim 26, wherein the 
instructions for semantically inferring further comprises selecting the predetermined 
command from the semantic anchor vector having the shortest distance to the vector 
corresponding to the semantic representation. 
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28. (Original) The machine-readable medium of claim 25, wherein the at least one 
semantic anchor represents a one of a plurality of predetermined commands. 

29. (Original) The machine-readable medium of claim 25, wherein the at least one 
semantic anchor is derived from a training data. 

30. (Original) The machine-readable medium of claim 25, wherein the performance 
of the instructions for semantically inferring the predetermined command depends on the 
order of the words in the processed sequence of words. 

3 1 . (Currently Amended) An apparatus for recognizing a voice command, the 
apparatus comprising: 

a speech recognizer to recognize a sequence of words received as the voice 
command; 

a processor to process the sequence of words using word agglomeration that 
replaces the sequence of words with an w-tuple sequence, the n-tuple sequence 
comprising word n-tuples that are all strings of n consecutive words in the sequence of 
words, wherein the w-tuple sequence is represented by a vector representation in a 
semantic space; and 

a semantic classifier, coupled to the processor, to semantically infer from the 
representation of the w-tuple sequence in the semantic space which of a plurality of 
predetermined commands correlate to the voice command. 

32. (Original) The apparatus of claim 3 1 , further comprising: 

an action generator, coupled to the semantic classifier, to use the vector 
representation to determine an action to be performed. 

33. (Original) The apparatus of claim 31, wherein the semantic classifier is further to 
compare the vector representation to a plurality of semantic anchors, wherein each of the 
plurality of semantic anchors corresponds to a one of the plurality of predetermined 
commands. 
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34. (Previously Presented) The apparatus of claim 33, wherein the semantic classifier 
is further to identify a semantic anchor from the plurality of semantic anchors that is most 
similar to the vector representation, and to classify the vector representation as the one of 
the plurality of predetermined commands that corresponds to the identified semantic 
anchor. 

35. (Currently Amended) An apparatus for recognizing a voice command, the 
apparatus comprising: 

means for recognizing a sequence of words received as the voice command; 

means for processing the sequence of words using word agglomeration that 
replaces the sequence of words with an w-tuple sequence, the «-tuple sequence 
comprising word n-tuples that are all strings of n consecutive words in the sequence of 
words, wherein the n-tuple sequence is represented by a vector representation 

means, coupled to the means for processing, for semantically inferring from the 
vector representation of the «-tuple sequence in the semantic space which of a plurality of 
predetermined commands correlate to the voice command. 

36. (Original) The apparatus of claim 35, further comprising means, coupled to the 
means for semantically inferring, for using the vector representation to determine an 
action to be performed. 

37. (Original) The apparatus of claim 35, wherein the means for semantically 
inferring further comprises: 

means for comparing the vector representation to a plurality of semantic anchors, 
wherein each of the plurality of semantic anchors corresponds to one of a plurality of 
predetermined commands. 

38. (Original) The apparatus of claim 37, wherein the means for semantically 
inferring further comprises: 
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means for identifying a semantic anchor of the plurality of semantic anchors that 
is most similar to the vector representation; and 

means for classifying the vector representation as the one of the plurality of 
predetermined commands that corresponds to the identified semantic anchor. 

39-45 (Canceled) 
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